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(54) Computer apparatus and method for merging a sequential piurality of deHa streams 



(57) A conputer appaiatus (20) and metfKXi for 
m6rgir>g a sequential piurality of delta streams (30. 32. 
34), wtierein each detta stream (30. 32. 34) represents 
a change to either a prior delta stream (32. 34). an orig- 
inal data stream (36). or an updated data stream (52). 
The method and apparatus (20) may be used to 1) 
merge a sequenti^ plurality of delta streams (30. 32, 
34) with an original data stream (36) to create an 
updated data stream (52). 2) merge a sequential plural- 
ity of delta streams (30. 32. 34) to create a compiled 
delta stream (294). or 3) merge a sequential plurality of 
negative delta streams (54. 56. 58) to retrieve a desired 
prior data stream (284). The method arxi apparatus (20) 
may t>e used in conjunction with a oonputer t)ackup 
process, version manager, or the like. In summary, a 
consumer process (22) initiates a numi^ of. search 
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requests(102), within a transaction chain (38) corre- 
sponding to the sequential plurality of delta streams (30. 
32. 34). for a number of data t>ytes to transfer to an 
u|3dated data stream (52). TTie search requests (102) 
may be fulfilled (1 10) with data bytes provided by the 
last delta stream in the transaction chain (38) capable of 
supplying data t>ytes. or if tfie sequential plurality of 
delta streams (30, 32. 34) is incapable of fulfaiing (1 10) 
the search request (102). it may be futflled (1 12) with 
data t>ytes provided t)y the original data stream (36). As 
the sequential plurality off delta streams (30, 32, 34) is 
merged, a sequential pluraltty of negative delta streams 
(54. 56. 58) may be generated, thus enabling recon- 
struction of a desired prior data stream (284). 
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Descrfpllon 

Cross-Rflterence to Related Appiicalion 

This application hereby incorporates by reference s 
the U.S. patent application of Squbb, Serial Na 
08^)39702 fUed March 30. 1993. 



Reference to BUcrofiche Appendix 

This application also incorporates k>y reference the 
computer program listing contained in the attached 
microfiGhe appendix. The microfiche appencfix com- 
prises 1 microffiche with 30 frames. 

BacKqiPunti gf the Invwitjon 
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This invention pertains to a conputer apparatus 
and mettKxj for merging a sequential plurality of delta 
streams, wherein each deHa stream represents a 20 
change to either a prior delta stream or an original data 
stream. More specifically, ttie invention pertains to a 
method and apparatus for 1) mer^ng a sequential plu* 
rality of delta streams with an original data stream to 
create an updated data stream. 2) merging a sequential 25 
plurality of delta streams to aeate a compiled delta 
stream, or 3) aealing a negative delta stream or 
streams as one or more delta streams are merged wHh 
an original delta stream, or wi\h each other. The metfKxJ 
and apparatus may t>e used in cor^unction ¥vith a com- 30 
puter t)ackLp process, version manager, or the Gka 

The atxave-referenced application to Squilsb dis- 
closes a metfiod and apparatus for prochjckig a cfiange 
list for an original and updated version of a conputer 
file. The method and apparatus utilize a hash generator ss 
CP U to produce a token set for an original fie. A conpa- 
rator CPU later uses the token set and a windowing 
technique to identify and correlate tfie locations of coex- 
istent pages in the original and updated files. The com- 
parator CPU then uses the coexistent page information 40 
and the residue of the updated file (or original) to create 
a deHa expressing the differences t)etween the original 
and updated files. The delta is transmitted to another 
conputer and combined with a t>ackup copy of tfie orig- 
inal fOe to aeate a l)ackup copy of the updated f Ha The 45 
original fie and a series of deltas are used to retain fus- 
tork^al file information in a cost effective manner. 

Backup operatfons requiring the merging of deltas 
as in Squi)b are significantty more streamlined than 
those requiring the mer^ng of inaemental changes, so 
For example, one Ggabyte of rraxed fie types may have 
adaily change in .S%of its information, kxjtttiose same 
cfianges may be spread through 15% of its files. Adaily 
delta fie created using the Squibb method would repre- 
sent tfie .5% cfiange and have a size of roughly five ss 
Megabytes, whereas a daily inaemental change file 
would represent the 1 5% change in files and have a size 
of just over 150 Megabytes. Assuming a weekly full 
backup and daily inaemental backups, 78 Gigak>ytes of 



informatkxi is written in one year, and each inaemental 
t}ackup takes approximately three hours. Using daily 
delta backips. only two Gigabytes of information is writ- 
ten over the course of a year, arxt each t)ackup sesskxi 
takes approxinately five minutes. 

The time and space savings are e^en more accen- 
tuated with regard to a .5% change in a one Gigabyte 
database. In a datatiase, the .5% change can be spread 
throughout 100% of the database, and t>aGkup opera- 
tkxis using a daily delta file as described in Squibb 
result in even more signficant time, space and cost sav- 
ings. 

AHfiough the . above-refererxsed apparatus arxl 
method of Squibb possess unk:|ue advantages, one 
shortcoming is that the one by one merging of deltas 
with a backip copy of a file requires a significant 
amount of a backup repository's CPU time, especially 
when ttie backup repository is required to merge deftas 
with several thousaixJ files on a daPy t>as«s. 

It is therefore a primary object of this invention to 
provide a method and apparatus whfoh reduce the 
number of operatfons cam'ed out by a t)aGkip reposi- 
tor/sCPU. 

Although the applrcation of Squfob cfisdoses tfiat 
delta files may be saved by ttie t>ackip repository, it 
would be advantageous to also aeate a negative delta 
for each d^ and save the negative deltas in the 
backup repository. The storing of negative deltas would 
alfow a backip repository to merge one or more of the 
negative deltas with its current backup copy, thereftiy 
reconstructing a prenous version of a f la 

It is tfierefore an additkxial object of tills invention to 
provide a method and apparatus wNch aBow for the ae- 
ation of negative deltas, thus enabling tfie retrieval of 
earlier file versfons. 

Since a very large numt>er of delta f ies can be cum- 
t)ersome to manage, it would be desirable to periodi- 
cally reduce the numt>er of deHa files (espedaly oU 
delta files) wtien they become too rxjmerous. However, 
merely deleting every Nth deHa wouU destroy the nec- 
essary sequential relatXMiships between the deltas. 

It is therefore a further object of tNs invention to 
provide a method and apparatus wNch aBow the merg- 
ing of several deltas to aeate a oonpied delta, thereby 
reducing the management demands placed on a 
backup repository's CPU. 

The delta files (hereinafter referred to as a "delta 
streams") cfiscussed in tiie prior U.S. patent applicatfon 
of Squibb comprise tnformatfon denoting the matcfung 
and cfianged segments between 1) an original data 
stream (e.g.. a computer file) and an updated data 
stream (e.g.. a revised copy of the conputer file), or 2) 
two successive updates of a data stream. Stream referis 
to the fact that data has a sequentially relationship. Not 
all delta fOes (change files) have a sequential relatfon- 
ship. 

It is preferable to store sequential files on a sequen- 
tial media, txith for ease of access, and t>ecau5e 
sequential media is signifk:antly less expensive tfian 



2 



3 



EP0774 720A2 



4 



non-sequential (seekable) mecfia. An example of a 
sequential media is the magnetic tape, wtiile examples 
of seekable media include floppy disks, hard drives, and 
CD-ROMS. 

To date, a method and apparatus for accomplishing s 
the objectives noted atxsve has not t>een devetoped. 

One existing method of merging delta fles is the 
iterative buiM method. An iterative build system succes- 
sively merges a base fie with one a more delta fOes. 
Delta fOes cannot be simultaneously merged with a 10 
base file. The iterative build method does not reduce the 
input^output (IAO)operatk>ns of the CPU performing the 
merge. A base fie must be read and written once for 
every delta fie with ¥vhk:h it is merged. Thus, the 
number of I/O operations to merge N delta fles with a is 
base file is equal to: 

((N*2K1)*(number of bytes in the base fie) 

Iterative build systems only work with seekable media. 20 
and have therefore not been used for merging delta 
streams stored on sequential mecfia if an iterative buid 
system were to t>e used to merge delta streams stored 
on a sequential media, the delta streams woukl first 
fiave to be copied on to a seekable media, thereby fur- 2s 
ther inaeasing the number of I/O operations performed 
by a CPU. A system of aeating negative delta files 
using an iterative build ^stem is rwt known. 

A variation of ttie iterative build method is the itera- 
tive buid pipeline method. A base file is fed into an 30 
update process associated with a skigle delta fia The 
output of the update process is "piped* to another 
tpdate process, and eo oa until afinal updated file is 
produced. The iterative buiU pipeCne system was made 
possible by the advent of multi-tasking systems, and 35 
only works on a multi-tasking system. An advantage of 
the system is that I/O is reduced. The number of I/O 
operations to merge N change ffles with a base file is 
equal to: 

40 

2*(nuni>er of bytes in ttie base fie) 

However, several disadvantages remain. Rrst piping 
causes a contextual program switch each time a pipe 
buffer is fufl. For a large t>ase fie being merged witti 4S 
numerous defta fles (also possi)ly large), many contex- 
tual switcfies occur arxl the performance of the system 
suffers. Second, the iterative buid piping system only 
works in a multi-tasking environmenL Third, the method 
does not support the aeafion of negative delta fies. so 
Rnalty. ttie metfxxl once again requires delta files to be 
stored in a seekable meda. 

A final method of merging delta fies is disclosed in 
U.S. Patent #5.278.979 to Foster et af. Foster et al dis- 
cfoses a verston management system using pointers. 55 
Again, files to be merged must be copied to a seekable 
(non-sequential) media for ttie pointers to work. The 
patent discloses an ability to retrieve prevkxjs versk)ns 
by a backwards comparison of verstons. This is quite 



different from creating a reverse path during a forward 
merge (all in a single pass). Additional disadvantages of 
the Foster metixxi include 1) poor performance with 
lengttiy base and delta files (the method is directed to 
"storkig multiple versfons of program source data' 
wherein the verskxis typically involve less ttuui one 
Megabyte of datsQ, and 2) a contextual limitation of 
processing changes to lines" (data must obey contex- 
tual rules, and is not treated as a continuous stream) . 

It is therefore yet another ok)ject of this invention to 
overcome the above disadvantagesof other delta merg- 
ing, backup and versfon management systems. 

Summafy of ttie invention 

In the achievement of the foregoing ot)jects, Ihe 
inventors tiave devised a computer apparatus and 
metlKXl for merging a sequentiaf plurafity of delta 
streams. The method and apparatus may t>e used to 1) 
merge a sequential plurality of delta streams with an 
original data stream to aeate an updated data stream, 
2) merge a sequential plurafity ql delta streams to cre- 
ate a compiled delta stream, or 3) create a negative 
delta stream or streams as one or more delta streams 
are merged witii an original delta stream or each other. 
The method and apparatus may be used in conjurKtion 
with a computer backup process, versfon manager, or 
thelika 

In summary, to merge a sequential plurality of delta 
streams with an original data stream, a computer con- 
structs a chavi of transaction elements corresponding to 
ttie sequential plurality of delta streams. The towest 
numbered transaction element corresponds to a delta 
stream representing a first revfefon to the original data 
stream. arxJ sequentially numt>ered transaction ele- 
ments oorresporKJ to sequential revisfons of the original 
data stream A consumer process initiates a search 
request, wittiin the transaction cfiain, for a numtwr of 
data bytes to transfer to the ifxiated data stream. The 
search request is initiated with the trailing transaction 
element (that which represents the last revisfon to the 
original data stream). The search request may t>e 1) ful- 
f iled with data tjytes provfoed ts/f the highest numbered 
delta stream in ttie transaction chain capabfo of supply- 
ing data bytes, or 2) if ttie transaction diain is incapat)le 
of fulfilling the search request, it may be fulfilled witti 
data bytes provfoed by the original data stream. When a 
search request cannot be fuHHIed, ttie updated data 
stream is corrplete. 

Optionally, a prior stream position can t>e monitored 
for each delta sb^eam. This enables the computer appa- 
ratus in transferring a data frame comprised of data 
from one or mae prior streams of a particular delta 
stream, to a negative delta stream associated with ttie 
particular delta stream, when a source match position of 
ttie particular detta stream is ^^eater ttian ttie prior 
stream position of the particular delta stream. A source 
matoh position is associated with every match frame of 
a delta stream, and indicates how much of a prior 
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stream should have been consumed b/ the time a par- 
ticular match frame is loaded. If enough of a prior 
stream has not been consumed, it is due to a file dele- 
tion occuning b^een successive delta streams* and 
the deleted inlbmnation is recorded in a negative delta 5 
stream. Alter the transfer of a data frame to a negative 
delta stream, a match frame inversion is created in the 
negative delta stream. 

A variation of the method comprises beginning with 
an updated data stream and merging it with a sequential 10 
plurality of negative delta streams to create a desired 
prior data stream (possUy the original data stream). 
Another variation ooivvirises the merging of several 
delta streams to create a compiled delta stream 

An advantage of the newfy developed method and is 
apparatus is an ability to merge deHa files (advanta- 
geously represented delta streams) with a file (origi- 
nal data stream) stored in a sequential media, without 
having to first transfer the files to a seekable mecfia 

Ottier advantages include 1) an at>ility to aeate 20 
negative delta streans so that earlier versions of a f ae 
may be reconstructed, 2) an ability to merge a number 
of delta streams and create negative delta streams in a 
single pass, wherein the number of 1^ operations per- 
formed by a CPU is limited to: 2S 

2*(number of bytes in the base f Se). 

3) an afaifty to merge muHipfe delta streanrs to create a 
compiled delta stream. theret>y reducing the manage- so 
ment burdens placed on a CPU. 4) an abifity to work effi- 
ctentty in conputer environments which do not support 
niulti-tasking. and 5) an ^lity to efficiently manage the 
merging of large data and delta streams. 

While the disclosed metfrad and apparatus provide 3S 
specific advantages for use with sequential storage 
media, they are also effective for use with seekable 
media. 

It wiU be appreciated, after a review of the following 
description, that since delta streams are small with 40 
respect to an entire fie (roughly .5% of tiie entire file's 
size), multiple deltas can be kept locally, such as in the 
hard drive of a PC. in addHion to being kept at a central 
repository. By patching the discfosed apparatus into a 
fife system or an application program, delta streams can 45 
be generated in lieu of ovenwriting ties. Thus, a user 
may retain several version of a docunient locally, wittXHJt 
using extensive disk storage space. 

These and other important advantages and objec- 
tives of the present invention wNl be further explained in. so 
or will become apparent from, the accompanying 
description, drawing and claims. 

Brief Description of the Drawings 

55 

An illustrative and presentiy preferred embodiment 
of tiie invention is illustrated in the drawing in which: 

FIG. 1 is a schematic of an apparatus for merging a 



plurality of sequential delta streams witfi an originat 
data stream to aeate a revised data stream; 
Fta 2 shows a flow of operations which may be 
performed by tiie consumer process of FIG. 1; 
FIG. 3 shows a flow of operations which may be 
performed by a transaction element of FIG. 1 in 
reading a delta frame of a corresponcfing delta 
stream; 

FIG. 4 shows a f Im of operations which may be 
performed by a transaction element of FIG. 1 in 
loading a delta frame of a corresponding delta 
stream; 

FIG. 5 shows a flow of operations which may t>e 
performed by a transaction element of FIG. 1 in 
acfvandng the stream position of a prior delta 
stream; 

Fia 6 shows a flow of operations which may be 
performed by a transaction element of FIG. 1 in 
transferring data from an original data stream to a 
negative delta stream; 

FIG. 7 shews a flow of operations which may t>e 
performed by a transaction element of FIG. 1 in cre- 
ating an inverse match frame for a negative delta 

stream; 

FIG. 8 shows a flow of operations which may be 
performed b/ a tiansaction element which has 
loaded tfie final frame of its corresponding delta 
stream; 

FIG. 9 shows ttwee reviskxis to a simple sentence, 
ttiree delta streams representing the revisions, and 
ttiree negative delta streams wftich could have 
been generated tiyttie apparatus of FIG. 1; 
RG. 10 is a schematic of an apparatus for merging 
a sequential plurality of negative delta streams witti 
an ifxfated data stream to reconstruct a desired 
prior data stream; 

Fia 11 isaschematicof an apparatus for merging 
a sequential plurafity of delta streams to create a 
oonpiled delta stream; 

FIG. 12 shows a flow of operations whfoh may be 
performed by the compiler consumer process of 
FIG. 11; 

FIG. 13 shows a flew of operations whfoh may be 
performed by a transaction element of FIG 11 in 
constructing a match frame in the compiled delta 
sb'eam;and 

FIG. 14 shows ttie compiled delta stream and ttiree 
negative delta streams aeated by ttie apparatus of 
FIG. 11 during a merge of ttie ttiree delta streams 
off FIG. 9. 

Description of the Preferred Embodiment 

A schematic of a computer apparatus 20 for merg- 
ing a sequential plurality of delta streams 30. 32. 34 witti 
an original data stream 36 is shown generally in FIG. 1 . 
which may comprise a transaction chain 38 comprising 
a plurality of sequenced tiansaction elements 24. 26. 
28, wherein each transaction element 24, 26, 28 has a 
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delta stream input 42, 44, 46 oorrespording to one of 
the sequential pluratrty of delta streams 30, 32. 34; and 
a consumer process 22. connected to a traSing element 
24 of the transaction chain 38, and comprising an input 
48 for the original data stream 38 and an output 50 fx 5 
the updated data stream 52. 

A metfxxj for merging a sequential plurality of delta 
streams 30, 32. 34 with an original data stream 36 is 
shown generally in the flow charts of FIGS. 2-^, which 
may comprise the steps of initiating a search request 
102. wHhin the sequential plurality of delta streams 30, 
32. 34, for a number of data bytes to transfer to the 
ijpdated data stream 52; fuMling 110 the search 
request 102 with dala bytes provided by the last delta 
stream in the sequential plurality of delta streams 30, 
32, 34 which is capable of supplying data bytes; if the 
sequential plurality of delta streams 30. 32. 34 is inca* 
pakile of fulflfing 110 the search request 102, fuelling 
112 the search request 102 wHh data bytes provided by 
the original data stream 36; and repeating the above 
steps until a search request 102 cannot be fulfiied 110, 
1 1 2, and ttie updated data stream S2 is complete 1 06. 

Having thus described ttie apparatus 20 and 
method for merging a sequential plurality of delta 
streams 30, 32, 34 witti an original data stream 36 in 
general, ttie system will now be described in further 
detail. 

In a preferred embodiment, the transaction ele- 
ments 24, 26. 28 of ttie apparatus 20 each further com- 
prise a delta stream reader, a prior transaction element 
index, a prior stream position nxxiitor. a current frame 
buffer, and a search request 102 fuffllment \fO interface 
66.68.70. 

The consumer process 22 is responsS^le for 
requesting 102 data for the updated data stream 52 
from ttie dhaxn 38 of transaction elements (transaction 
chain). The prior transaction element index of a transac- 
tion element 24, 26, 28 is used in sequentiaBy propogat- 
ing ttie search requests 102 of the consumer process 
22 through ttie transaction chain 38. Each search 
request 102 rs iksi initiated 100 in ttie trailing element 
24 (TE2) of ttie transaction chain 38, or ttiat which cor- 
responds to a last revision 30 to the original data stream 
36. In response to the transaction elements' 24, 26. 28 
prior transaction element indices, search requests 102 
are propagated through progressively eariier revisions 
40 of ttie original data stream 36, until at last, a search 
request 102 reaches the transaction element 28 corre- 
sponding to a first revision to ttie original data stream 34 
(ttie 0-element or TEO). so 

The delta stream input 42, 44. 46 of a transaction 
element 24, 26, 28 is regulated so as to sequentially 
load 160 delta frames (match and/or data frames) from 
a corresponding delta stream 30, 32, 34 into the trans- 
action elemenTs current frame buffer 170. The steps 55 
performed in regulating a particular transaction ele- 
ment's 24. 26. 28 delta stream input 42, 44, 46 are listed 
in ttie flow chart of FIG. 4 and will t>e described in more 
detail later In this specif icatioa 



When a match frame is loaded 168 into a transac- 
tion element's 24, 26, 28 cunrent frame buffer 170, ttie 
source match position associated witti the frame is com- 
pared to ttie prior stream position monitored by ttie 
transaction element 24, 26, 28. A prior stream position 
monitor of a transaction element 24, 26. 28 indicates 
how many bytes of ttie transaction efemenTs 24, 26. 28 
prior stream 32, 34, 36 have been consumed. The prior 
stream 32, 34, 36 of a transaction element 24, 26, 28 is 
identified by the transaction elemenTs 24, 26. 28 prior 
transaction element index. A source match posrtion is a 
nunrtier included within a natch frame which indicates 
hew many bytes of a prior stream 40 shoiAt have been 
consumed by ttie time a particular match frame is 
loaded into its transaction element 24, 26. 28. If ttie 
source match position is greater ttian ttie prior stream 
posrtion 172. a second search request 196 is initiated so 
as to advance the stream position of the prior stream 
32, 34, 36 by a number of tyytes equal to ttie dffierence 
between the source match position and the prior stream 
posrtion 172 (ttie prior stream lag count). The steps of 
ttiis process are outtined in the flow chart of FIG 5. 

The delta stream readers are responsMe for read- 
ing 130 the delta frames (match andAor data frames) of 
a delta stream 30. 32. 34 once ttiey have been loaded 
120 into a transaction elemenfs 24. 26. 28 cunrent 
frame buffer. A data frame may be used to fulfill 136 a 
search request 102 initiated by ttie consumer process 
22 (via ttie search fulfillment 1^ interfaces 66, 68. 70). 
A match frame causes the search request 102 to be 
passed 146 to a prior transaction element 26, 28, or 
when a match frame resides in ttie current frame buffer 
of ttie last transaction element 28 in the ttansaction 
chain 38, the match frame causes the consumer proc- 
ess 22 to fulfill 112 its request 102 witti data from ttie 
original data stream 36. When a search request 102 is 
passed 146 to a prior transaction element 26. 28, ttie 
prior transaction element 26, 28 index of a transaction 
element 24, 26, 28 is used to determine to which trans- 
action element 26, 28 the search request 102 will be 
passed 146. The steps carried out by ttie delta frame 
reader are outlined In FIG. 3. 

An important addition to the preferred entxxfiment 
of ttie computer apparatus 20 is ttie inclusion of nega- 
tive delta stream writers 60, 62, 64 wHhtn each of ttie 
transaction elements 24, 26. 28. Wittiout the negative 
delta stream writers 60. 62, 64, the data bfies 1 72 of a 
prior stream 32. 34. 36 which are slqpped over during 
ttie advandng of a prior stream (FIG. 5) are lost. A neg- 
ative stream writer 60. 62, 64 is called into action by ttie 
advancing of a prior stream so that slqpped data t)ytes 
172 are written 204 (in the form of a data frame) into a 
negative delta stream 54, 56. 58 associated witti a par- 
ticular delta stream 30, 32, 34. After writing 204 a data 
frame to a negative delta stream 54, 56, 58. a match 
frame inversion of a newly loaded match frame is written 
1 78 to the same negative delta stream 54, 56. 58. In this 
manner, the negative delta streams 54. 56. 58 provide 
information for constructing a ttansaction chain 38 to 
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merge a sequentiaf plurality of negative delta streams 
54. 56, 58 with an updated delta stream 52 to retrieve a 
desired prior data stream 284 (whettier it be ttie origi nal 
data stream 36 or fie, or some intermediate revision of 
the ori^nat data stream). Flow charts for the processes s 
of transferring 204, 178 data franes and match frame 
inversions to a negative deHa stream 54. 56, 58 are 
shown in FIGSw 6and 7. respectively. 

While the apparatus 20 of FIG. 1 mayt>eentxxlied 
in a programmable computer, the apparatus 20 is not io 
limited to such an embodiment. The apparatus 20 may 
also t>e embodied in a physical storage media such as a 
floppy dsk, CD-ROM. tape, or the Ike. 

Preferred entodiments of the process steps and 
routines referred to ak)cve will rKiwk)edescri)ed in addi- is 
tional detail. 

Consumer PiDoess 

The consumer process 22 is responsfole lor build- 20 
ing the updated data stream 52. See FIG. 2. It repeat- 
edly requests 102 data from ihe transaction chain 38 
(chain of transaction elements). All requests 1 02 are ini- 
tiated 100 with the traOing element 24 (O-eiement) of the 
trarvaclion chain 38. The trailing element 24 is that 2s 
wfiich reads a defta stream 30 representing a last revi- 
sion to an original data stream 36, fie. database, or the 

A transaction element 24, 26, 28 can respond 1 44, 
136 to a request 102 of the consumer process 22 in one so 
of two ways. A transaction element 24, 26. 28 can fulfill 
110 a request 102 k>y returning 134 data from a current 
data frame (data from CURFRAMEBUF 170), or atter- 
nativeiy, by returning 144 a nurft>er representing a 
nurTt>erof data bytes which need to l>e transferred 112 3s 
from the original data stream 36 to the updated data 
stream 52. Any data received 108 by the consumer 
process 22 (through the transaction chain 38) is trans- 
ferred 1 1 0, 1 12 to the updated data stream 52. The con- 
sumer process 22 continues to initiate requests 102 for 40 
data until the transaction chain 38 is unatsie 104 to fulfill 
a request 1 02, and the consumer process 22 corx:iudes 
that the updated data stream has t^een completed 106. 

A preferred flow of operations for the CONSUMER 
PROCESS of FIG. 1 is shown in FIG. 2. The transaction 45 
context index (TCI) is used to identify 100 the transac- 
tion element 24 to wNch a request 102 will be made. As 
far as the consumer process 22 is corx:emed, the 
request 102 will always be made to ttie trailing element 
24. Therefore, the consumer process 22 sets 100 the so 
transaction context index to one less than the transac- 
tion count (TRANSACT10NCMT). or the total nurTt>er of 
transaction elements 24. 26, 28 in the transaction cfiain 
38. hfoto that the transaction element 24 servicing the 
detta stream 30 representing the first revision to the & 
original data stream 36 is designated as the 0-element. 

GOTCNT represents the greatest nunrtoer of data 
t}ytes which can be transferred 1 1 0, 1 1 2 to the updated 
data stream 52 following a single request 102 of the 



oor^mer process 22. If date is returned 134 by a trans- 
action element 24, 26. 28, GOTCNT will represent the 
total nunober of date bytes returned 136. The date bytes 
will be returned 134 via a return buffer (RETURNBUF). 
If the transaction cfiain 38 cannot supply date from its 
current frames, but date still remains in tfie original date 
stream 36, G0TCf4T will represent the nunto of date 
t)ytes to t>e trartsfened 112 from the original date 
stream 36 to the updated date stream 52. When the 
consumer process 22 initiates a search request 102. 
GOTCMT is set to OxFFFFFFFF Thus, the consumer 
process 22 is asking ttie transaction chain 38 for as 
many tiytes as can possfoly be returned 136, 144. 

Reading a DeHa Frame 

The frame reading process may k>e called tiy any 
transaction element 24, 26, 28. and in various oonteocts. 
For example, t>ytes may be read from ttie current frame 
of a transaction element 24, 26, 28 In response to a 
search request 102 of tfie consumer process 22, or a 
search request 196 of anotfier transaction element 24, 
26, 28 wNch is attempting to advance ite prior stream 
position. 

A preferred flow for tfie frame reading process, 
called READ FRAME, is shown in FIG. 5. 

When tfie consuner process 22 initiates a request 
102 for the largest rumber of date tiytes the trarisaction 
diain 38 can 8^)piy, its request 102 is first passed 114 
to the trailing element 24 of the transaction ctiain 38. 
The trailing element 24 will initiate ttie delte frame (date 
and/or match) reading process. See FIG. 3. If all of ttie 
bytes 116 represented by a transaction elemenfs 24, 
26, 28 current frame have t>een used 118. a new frame 
will be loaded 120 into the current frame t)ufrer 
(CURFRAMEBUF) of the transaction element 24, 26, 
28. If the frame loadng process returns an end-cf-file 
(EOF) condition 122. a GOTCf^TT of 0 is returned 124 to 
the corisumer process 22, and the biiUing of a particu- 
lar updated date stream is complete 106. Otherwise, ttie 
loading of a new frame causes tlie nvrrber of unused 
bytes In a current frame (UNU) to be reset 126 to the 
newly loaded frame's length (CfL), 

Once it is assured that a cunent frame lias been 
loaded, and ttiat the bytes of a frame have not been 
depleted 118. GETCNT (a local variable of the Read 
Frame process used to update the GOFCfMT request 
variable) is set 128 to the minimum of tfie number of 
t)ytes requested 1 02 in a search, or the nunfoer of tiytes 
available 116. 126 in a current frame. If the current 
frame is a date frame 130. a GETCNT number of date 
bytes will be copied 134 into RETURNBUF and 
returned 136 to the consumer process 22. If the current 
frame is not a date frame 130, the consumer process* 
22 request 102 will be passed 146 to a prior fransaction 
element 26, and so on 28. until either 1) date which can 
t>e returned 134 to the consumer process 22 is found 
within a current frame of a fransaction element 24. 26, 
28. or 2) it is determined 138 that none of the fransac- 



6 



11 



EP0774720A2 



12 



tion elements 24, 26. 28 can supphr date, and a positive that an tfxlated date stream 52 is complete 106, a 
GETCNT numt>ef is returned 144 to ttie consumer proc- DELTA TERMINATE process 164 insures that each and 
ess 22, indicating that date must be transferred 1 12 every transaction element 24, 26, 28 continues to read 
from the original date stream 36 to the updated date its delte stream 30, 32. 34 into an EOF frame is 
stream 52. When the consumer process' 22 request s reached 162. In this manner, negative delte streams 54, 
1 02 is transfen-ed to a prior transaction element 26. 28, 56, 58 are not left in an unfinished slata 
that element 26. 28 executes 148 the same defta frame If a ne«^ loaded frame is a date frame 168. a trans- 
reading process that was executed by the tailing ele- action elemenfs 24. 26, 28 cunent frame buffer 
ment24. However, each time the frame reacting process (CURFRAMEBUF) is loaded 170 with the frame'fe date 
is initiated by a transaction element 24, 26, 28. a unique w bytes. 

copy of the local variables associated with the process If a match frame is loaded 168. a source match 

are stored within a fransaction context of a particular position (SMP) associated with the match frame is com- 

fransacticn element 24, 26, 28. A fransaction contexts pared 1 72 to the loading fransaction elemenTs 24, 26. 

variables include: TCI. CFU UNU. CPU & PSP. 28 PSP. The source match position indicates how many 

TCI has afready been defined as the transaction is bytes should have already been consumed from a 

context index. CFU represents the number of used stream 32, 34. 36 associated with an imnr>ediately prior 

bytes (characters) In a current frame. The CFU of every fransaction element 26. 28 by the time the frame asso^ 

fransaction element fr> which a search request is elated with the particular SMP is loaded. ITierefore. if a 

passed is incremented 132. 154 by the GETCNT or match frame*s SMP exceeds its fransaction elemenTs 

GOTnunnberrelumed136. 158. 20 PSP. a number of bytes representing the difference 172 

PSP represents the prior sfream position assod- (PSLC. or prior sfream lag count) wOl need to be coll- 
ated with a fransaction element 24, 26, 28, and is an sumed 1 74 from the prior sfream 32, 34, 36 before pro- 
Inrporlant variable in advavxang a prior sfream'^ position oeeding with the cunent search request 102 of the 
(this task wfil be described in greater detail In a tater consumer process 22. 

section). However, note that the PSP of every fransac- 2S After a prior sfream has been advanced 174, and a 

tion element 24, 26. 28 to which a search request is number of date bytes representing the PSLC (date 

passed, butforthe PSP of a transaction element24, 26, deleted between consecutive revisions of the original 

28 fulfining 136 a search request 102, is advanced 156 date sfream) have been fransferred 204. 194,202toa 

by the number of bytes (GETClsfT or GOT) retumed to particular negative delte sfream 54, 56. 58. a routine is 

the consumer process 22 or other requesting entity 24, 30 called 176 which inverts the newly loaded match frame 

26.28. and causes the match frame inversion to be written 178 

Other variables referred to 152 In the READ into a particuter fransaction element's 24. 26. 28 nega- 

FRAME flow are: F2_BEG. F2_END and CF.F2„BEa tive deHa sfream 54. 56, 58. A match frame inversion is 

F2_BEG and F2.END represent the beginning and a frame which indicates a reverse mafr:hconcfition(i a. 

ending addresses of the updated date sfream refer- 35 which date bytes in an updated data sfream 52 match 

enced by a current match frame, and CFF2_BEQ repre- the bytes of a prior date sfream 284). 

sente the next address to be filled in the updated date Regardless of whether a date or mafc;h frame is 

sfream52. These variables have additional significance loaded 120 as the new cunrent frame of a fransaction 

when creating 1 76 a match frame inversion to write 1 78 element 24. 26, 28. the nunt)er of used bytes in the cur- 

to a negative deHa sfream 54, 56. 58, or when con- 4o rent frame (CFU) is reset 180 to zera and the cunent 

sfructBig 1 40 a match frame to write to a compiled delte frame's length (CFL) is set 1 82 to represent the nunt>er 

sfream 294. of bytes referenced by the new frame. 



Loading a DeHa Fiame 

4S 

When the number of unused bytes (UNU) in a 
fransaction eiemenfs 24. 26. 28 cunrent delta frame is 
determined 1 1 8 to be zero, a new frame must be loaded 
120 as ttie fransaction elemenTs 24. 26. 28 current 
frame. The process of loading a new detta frame is so 
revealed in the flow chart of FIG. 4. 

AlttKXjgh not discussed elsewhere in this applica- 
tion, each delte frame in a detta sfream's 30. 32, 34 
detta frame sequence is preceded by a frame header 
(or attematively. the header intonnation may be 55 
Included within the frame). The frame header is read 
160 to determine whether a match frame, data frame or 
EOF frame has been encountered. If an EOF frame is 
loaded 162. signifying 166 to the consumer process 22 



Advancing ttie Number of Bytes (consumed by a 
Prior Stream 

Whenever the loading 120 of a match frame creates 
172 a posHive PSLC. the number of Kyytes consumed 
from a prior transaction elenrienrs 26, 28 detta sfream 
32. 34 must be advanced by the PSIjC. The ADVANCE 
PRIOR STREAM flow of FIG. 5 shows the steps 
involved in advancing a fransaction element's 26, 28 
prior sfream 32, 34. while the steps involved in advanc- 
ing 1 94, 202 the number of bytes consumed by the orig- 
inal date sfream 36 are shown in the TRANSFER BASE 
TO NEGATIVE DELTA flow chart (FIG. 6), 

H a posttive PSLC remains 1 86, and a prior fransac- 
tion element 26. 28 existe 192, a second search request 
196 is initiated while the search request 102 of the con- 
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sumer process 22 is held in abeyance. The number of 
data bytes requested 196 in the second search request 
Is defined by the SKIPCNT variable, which is inrtiaily 
equal to PSLC. The second search request 196 neces- 
sitates the running of adcftionat instances of the FKB. 3 5 
frame reading process. tMe tfiat the frame reading 
process may call itself 148. as part of the second search 
request 196, in an attempt to fulfil 136, 144 a request 
196 for a SKIPCNT number of data bytes. Data is 
retumed 200 to the ADVANCE PRIOR STREAM flow 10 
via the RETURNBUR If data is returned 200. it is written 
204 to a particular negative delta stream 54. 56. 58 in 
the form of a data frame. If a positive GOTCNT number 
is returned, but RETURNBUF is empty, the TRANSFER 
BASE TO NEQAnVE DELTA routine 202 is used to is 
transfer 21 2 data from the origral data stream 36 to ttie 
particultf negative delta stream 54. 56. 58. If a second 
search request 196 is only partiaOy fulfSled 204 by a 
QOTCNT number of data bytes. SKIPCNT is reduced 
206 by the QOTCNT number, and the steps of the prior 20 
stream advancement flow are repeated. 

The prior stream position of every transaction ele- 
ment 24, 26, 28 having a position number in the trans- 
action chain 38 which is less than or equal to the 
number of the transaction element 24, 26, 28 initiating a 2S 
second search request 196. but greater than the 
nunt>er of the transaction element 24. 26. 28 (or stream 
36) fUlfifing the request 196, is increased 188, 208 by 
the number of bytes transferred 204. 194, 202 to a par- 
ticular negative delta stream 54. 56. 58. 30 

Closely related to the ADVANCE PRIOR STREAM 
routine is the TRANSFER BASE TO NEGATIVE DELTA 
routine. As eariy stated, the second routine is called 
when the elements 24. 26. 28 of a transaction chain 38 
prk)r40toaparticttoelement24.26.28areunableto ss 
fulfin a request 174. 196 lor a PSLC (SKIPCNT) number 
of data bytes. The TRANSFER BASE TO NEGATIVE 
DELTA routine may seem like overfdil with respect to 
merging a sequential plurality of delta streams 30, 32. 
34 with an original data stream 36. However, when 40 
merging a sequential plurality of delta streams 30, 32. 
34 to create a compiled delta stream 294. the original 
data stream 36 may not be available. It is therefore nec- 
essary to include a token 210, 212 in a negative delta 
stream 288. 290. 292 whk;h references the missing 4S 
original data stream 36. If the ori^'nal data stream 36 is 
present data bytes from the ori^nal data stream 36 
may be used to fiD 214 the Base (originaQ Stream Ref- 
erence Tokea PASSCNT is the k)cal variable of the 
TRANSFER BASE TO NEGAHVE DELTA ftow which so 
initially equals either the PSLC or SKIPChTT variable of 
its calling routine. 

Creating a Match Frame Inyersion 

55 

FIG. 7 shows the steps involved in creating an 
inverse match frame or match frame inversion. The 
steps involve a simple algoritiimic inversion 218. 
whereby a frame indicating ^ segment in a delta stream 



30. 32. 34 whk:h matches a segment in a prior delta 
stream 32. 34. 36 representing a prior reviskxi to an 
original data stream 36 is Inverted' so that the inverse 
match frame recorded 178 in a negative delta stream 
54. 56, 58 indicates a segment in a negative delta 
stream 54, 56, 58 whk^h matches a segment in a nega- 
tive delta stream 54. 56. 58 representing a prior revision 
to an updated data stream 52. 

Insuring that Each TyansactJon Element has Read 
All of the Firames in Hs Conresponding Delia Stream 

By the time an end-of-file (EOF) frame has been 
reached 162 by a transaction elenrient 24, 26, 28. the 
updated data stream 52 has been completed 1 06. How- 
ever, ttie final frames of one or more delta streams 30. 
32. 34 may nc/t have t>een read. In order to insure ttiat 
an negative deKa streams 54. 56. 58 are complete, a 
DELTA TEra^lNATE piocess is iretiated (FKl 8). 

An EOF frame will include a SMP relerencffig tfie 
last byte in the prior transaction elemenfs 56. 58 delta 
stream 32, 34. A prior stream lag count (PSLC) is calcu- 
lated 222 as the difference tietween tfie EOF framed 
SMP and the PSP of ttie transaction element 24, 26, 28 
loading the EOF frame. Prior streams 40 are advanced 
224 by calfing tiie ADVANCE PRIOR STREAM routine 
of FKa. 5 using ttie PSLC as ttie number of data bytes 
left to read in a prior stream 32.34,36. As EOF frarhe*8 
are encountered 228 in prior streams 40, additional 
instances of ttie DELTA TERMINATE routine wiU be am, 
thus causing a ripple effect wtiich insures 232 ttiat all 
frames of aH delta streams have been read. 

Adcfitional understanding of the mettiod and appa- 
ratus 20 for mer^ng a sequential plurality of delta 
streams 24. 26. 28 witti an aiginal data stream 36 may 
t>e derived from the exanple of FIG. 9. The figure shows 
three revisions 240. 242. 244 to the simple text file 238. 
'THISJS_AjrEST_PAGE.- Following ttie ttiree revi- 
sions 240, 242, 244 are ttiree delta streams aeated by 
application of the mettiod descrit>ed in U.S. Patent 
a|3plication Serial No. 08/039.702 of Squbbi The data 
frames comprise added data (an ac{ustment). The 
match frames conprise a source match position (earty 
fie index), followed tyy the beginning (updated fie offset) 
and ending bytes of a se^nent in ttie revised fie which 
matches a segment in ttie prior file. 

Using ttie ttiree delta streams, ttie original text file 
238. and the apparatus 20 and method descr toed above 
for merging them, the final revised text file 244 of FIG. 9 
may be constructed. 

The merge begins as a transaction chain 38 is con- 
structed, and all global and transaction context varia- 
bles are initialized. The ttiree revisions 240. 242. 244 of 
FKx 9, represented tiy delta streams 0. 1 and 2. corre- 
spond witti transaction elements 0. 1 and2of FIG. 1. 

The consumer process 22 requests 102 as many 
data bytes as the transaction ctiain 38 can supply, and 
initiates its search 102 in ttie trailing transaction element 
24 (TE2). Since ttie current frame of TE2 is undefined. 
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the first frame 258 of the corresponding delta stream 30 
is loaded 120 as the current frame of TE2. In loading the 
frame 258. an inversion of tf>e match frame is recorded 
1 78 in the negative delta stream 54 of TE2. Prior stream 
advancement does not occur since the source match $ 
position (SMP) of tfte loaded match frame 258 is zera 
The rriatch frame 258 comprises tenisytBS. thus causing 
a search request 102 for ten data bytes to be passed 
146 to TE1. TE1 loads 120 its first frame 252. and 
records 178 an irwefsion of its match frame in As oorre- io 
sponcf ng negative delta stream 56. Once again, the 
SMP of the nem frame is zero, and prior stream 
advancement does not occur. TTie newly loaded frame 
252ofTE1 is a match frame of five bytes, so the search 
request 102 is modified 128 again (to five data bytes) is 
before bekig passed 146 to TEO. Finally. TEO loads 120 
its first delta frame 246. a match frame 246 of ten t>yte& 
An inverse of the match frame is recofded 1 78 in TEO'b 
negative defta stream 58, but prior stream advancement 
does not occur. Since TEO's current fraine length of ten 20 
bytes is greater tfian the requested number of data 
t>ytes. TEO does not mcxSSty the search request 102. 
However, since rt has no data to supply, it retoms 144 
tfie mocfified search request 102 for five data tiytes to 
ttie consumer process 22 by propogating tfie number 2s 
five back through tfie transaction cfiaki 38. As the 
number five is passed back 144. 158 to each transac- 
tion element 26. 24. the prior stream position (PSP) of 
each transaction element 26. 24 is incremented 156 by 
five. Upon receiving its unfulfiDed search request 102. 30 
ttie consumer process 22 transfers 1 1 2 the first five data 
bytes of tfie original text file 238 (THISJ to the updated 
text file 52. 

Another search request 102 is now initiated kyy tfie 
consumer process 22. Since the current frame of is 3S 
a match frame 258 witti five unused t>ytes. tfie search 
request 102 is mocfified 128 and passed 146 along. 
TE1 . having depleted tiie five bfies of its current match 
frame 252. loads 120 the second delta frame 254 of its 
delta stream 32. The second frame 254 is 8 data frame. 4o 
and the search request 102 is fulfilled 134, 136 with five 
data bytes (COULD). As the data s returned through 
the transaction chain 38, the PSP of TE2 rises from five 
to ten. 

During the tfiird request 1 02 of the consumer proc- 45 
ess 22. TE2 must load 120 a new della frame. Since ttie 
new trame 260 is a data frame, its entire contents LB) 
are returned 134 to ttie consumer process 22. The 
updated text f ye 52 now reads. THIS.COULD.B*. 

A next request 102 of ttie consumer process 22 so 
causes TE2 to load ttie ttiird frame 262 of Hs delta 
stream 30. Since ttie frame 262 is a match trame, ttie 
SMP of ttie frame 262 is oorvpared to TE2*s PSP and it 
is determined 172 ttiat the stream 32 prior to TE2 needs 
to be advanced bf five bytes (15-10=5). A second ss 
search request 196. for five bytes, is initiated in the 
streams 40 prior to TE2. Since the current data frame 
254 of TE1 can supply ttie entire five bytes requested in 
the second search 196, and fulfill ttie necessary 



advancement of TE2's PSP. ttie second search request 
196 is terminated 190. The five bytes returned by TE1 
(NT_B) are written 204 into ttie negative delta stream 
54 of TE2. Having advanced its PSP to fifteen. TE2 
records 1 78 ttie inverse of Hs newly loaded match frame 
262 in its negative delta stream 54, reduces the con- 
sumer process' requested number of bytes to fifteen 
(ttie lengtti of its current match frame), and passes 1 46 
ttie request 102 to TE1. TE1 can supply four data bytes 
(E_AJ from its current data frame 254. As the data is 
returned 136 to the consumer process 22. T^s PSP is 
incremented 1 56 to nineteen. 

The next request 1 02 of the consumer process 22 
is passed 146 ttirough TE2. being rec&jced 128 to 
eleven. TE1 must noM load 120 a new delta frame. 
Since its next available frame is a match frame 256. it 
detenranes ttiat the SMP of the new frame 256 is 
greater ttian its PSP (10-5s5). and the prior stream 
must be ad^^nced 1 74 by five bytes. Since TEO has five 
bytes left in its match frame 246. ttie next five bytes d 
ttie original text file 238 (IS.AJ must be transfen-ed 194 
to the negative delta stream 56 of TE1. In returning ttie 
data ttirough ttie transaction chain 38, the PSP of TEO 
is inaemented 156 to ten. and the PSP of TH1 is also 
incremented 156 to ten. An inverse of TEVs new match 
frame 256 is recorded 178 in its negative delta stream 
56. Since the newly loaded matcfi frame 256 of TE1 is 
longer ttian eleven bytes, ttie consumer process' 22 
search request 102 is passed 146. unmodified, to TEO. 
TEO is required to load 120 the next frame 248 of Hs 
deHa stream 34. The frame 248 is a data frame of eight 
bytes, so ttie search request 102 is modified 128, and 
ei^ data bytes (RADICALJ are retumed 136 ttirough 
tfie transaction chain 38. The PSP of TE1 is incre- 
mented 156 to eighteen, and ttie PSP of TE2 is incre- 
mented 156 to twentyseven. The updated text file 52 
being aeated by the consumer process 22 now reads, 
•TrHIS_CX)ULD_BE_A_RADICAL", 

An adcfitional request 102 of the consumer process 
22 is reduced 128 to a request 102 fr)r ttYee data bytes 
as H IS passed 146 througti TE2. The request 102 frx 
ttiree data bytes is passed 146 ttirough TE1 wHhout 
modification. Upon reaching TEO, the search request 
102 is held in abeyance as TEO loads 120 ttie last frame 
250 of Hs deHa stream 34. Since ttie SMP of ttie loaded 
match frame 250 equals TEO's PSP (10=10), prior 
stream advancement does not occur. However, an 
inverse of the match frame is recorded 178 in TEO's 
negative deHa stream 58. The request 102 for tftree 
data bytes is returned 144 to tfie consumer process 22 
unfutfiiled. As H is returned. TEVs PSP rises to twenty- 
one, and TE2*s PSP rises to thirty. The next three bytes 
(TES) of ttie original text file 238 are transferred 112 to 
ttie updated text file 52. 

The next request 102 of the consumier process 22 
is fulfilled 134. 136 by ttie two t>ytes (T.) of a newly 
loaded data frame 264 of TE2. 

A final request 102 of ttie consumer process 22 is 
returned unfulf lied, having t>een reduced to zero when 
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TE2 acknovvledged reaching the end 122 of rts data 
stream 30. The consumer process 22 has therefore 
completed rts buicfing of the ifxiated text f fle 244. 
THIS.COULD_BE_A.RADlCALjrEST" 

When TE2 acknowledges 166 reaching the end of s 
its delta stream 30, a ripple effect is set into action which 
causes the other transaction elements 26. 28 to wind up 
224 the reading of their own delta streams 32. 34 and 
complete 232 the buildrig of their negative delta 
streanv 56, 58. In this process, the last seven charac- io 
ters of the origpnal text f fle 238 (T_RAGE.) are written to 
the negative delta stream 54 of TE2 as a data frame. 

It should be understood that while the above exanv 
pie initiated a mer^'ng of three delta streams 30. 32. 34 
with an original data stream 36. the above n^ethod and is 
apparatus 20 wil readily merge from one to many delta 
streams with an original data stream 36. The number of 
streams to be merged is only limited by the capabiKties 
ofagivenCPU. 

If it was desired to reverse the abcve merge proc- 20 
ess. and reconstruct the origrnat text fle 238 of FIG. 9. 
having only the updated text file 244 and the three neg- 
ative delta streams 54. 56. 58. the computer apparatus 
20 could construct the transaction chain 38 of FIGL 10. 
Note that the negative delta streams 54. 56, 58 are 2s 
linked to the transaction elements 24, 26, 28 in a 
reverse order, and thcrt negatives of the negative delta 
streams 54. 56. 58 are rK>t created (m reality, the nega- 
tive delta stream writere are linked to nufl devices). 
Stepping through the method of FIGS. 2-8. the ori^'nai 30 
text file 238 could be reconstructed. Likewise, a transac- 
tion chain comprising only negative delta streams 2 and 
1 could be used to reconstruct ttte first revision 240 (a 
desired prior data stream 284) to the original text fie 
238. 35 

The method and apparatus 20 descrbed atx3ve. 
with some maior modifications, can also t>e used to 
merge a sequential plurality of delta streams 30. 32.34 
to aeate a compiled delta stream 294. Rather than 
bar nJiBf ling derta bytes to an updated data stream 52. 40 
delta frames (data and^or match) are transferred 306. 
308 to a compiled delta stream 294. 

The modifications wfiich need to k>e made to the 
apparatus 20 of FIG. 1 include the sut)stitution of a com- 
piler consumer process 286 and compiled delta stream 45 
294 connection for the consumer process 22 and 
updated data stream 52 connection. These mocffica- 
tions are shown in the apparatus 334 of FIG. 11. Adcfi- 
tionafly. the original data stream 48 connections are 
removed. The only change in the method is a change in so 
tfie flow of operations within the compiler cor«umer 
process 286. as outlined below. 

Compiler Consumer Process 

55 

The compiler consumer process 286 rs responsfele 
for building the compiled delta stream 294, and is sirrvlar 
in most respects to the consumer process 22 which 
builds an ipdated data stream 52. Again, it repeatedy 



requests 298 data from the transaction element chain 
38, and all requests 298 are initiated 296 to the trailing 
element 24 of tf^ chain 38. The transaction elements 
24. 26. 28 respond to the compiler consumer process 
286 in tfie same manner tfiat they would respond to a 
request 102 from a ^andard consumer process 22. 
However, some slighl differences exist If data is 
returned 304 by a transaction element 24. 26. 28. it is 
copied 306 into the compOed delta stream 294 as a data 
frame. The data bytes of the data frame are not 
extracted. This is due to the fact ttiat a "delta stream' 
294 rather than a *data stream" 52 is being compiled. If 
the transaction cfiain 38 returns a positive GOTCNT 
number, data is not transferred from ttie original data 
stream 36. since a connection to that stream may not 
exist Instead of transfem'ng data from the original data 
stream 36. a match fiame which references the aiginal 
data stream 36 is written 308 in the oorrpiled delta 
stream 294. 

The compiler consumer process 286 continues to 
initiate requests 298 uiti! ttie transaction elements 24. 
26. 28 reach the end of their delta streams 30, 32, 34 
and are unable 302 to fulf 31 306 a request 298 or return 
308 a positive GOTCf^ number. 

A preferred flow of operations for the compiler con- 
sumer process 286 of FIG. 11 is shown in FIG. 12. The 
variables in ttte ffew cfiart of FIG. 12 are identical to 
those of FIG. 2 for tfie standard consumer process 22. 

The frame reading process (FIG. 3) called by the 
oorrpiler consumer process 286 is identical to the frame 
reacfing process previously 6escrbed. However, the 
result 312 of the Construct Match Frame process, which 
was previously ignored, is nm used wtien writing 308 a 
match frame to the oonrpOed delta stream 294. As pre- 
viously slated, in a compiled delta stream 294. a match 
frame is constructed 140 in Deu of transferring data from 
an original data stream 36 to the compiled delta stream 
294. The process of constructing 310 a match frame in 
a compfled delta stream 294 is described below. 

Constructing a Match Firame in a Compiled Delta 
Stream 

If a search request 298 of the compiler consumer 
process 286 is passed throu^ the transaction chain 38. 
and none of tfie transaction elemerrts 24. 26, 28 is capa- 
ble of returning data to the compiler consumer process 
286, amatch frame must t>econstructed3108Otfiat the 
compiled delta stream 294 wifl be able to instruct a con- 
sumer process 22 when to draw data from an original 
data stream 36. 

Aflow of operations for the match frame construc- 
tion process is shown in FIG. 13. The match frame con- 
struction process will always be called tiy TEO. the 
transaction element 28 at ttie end of ttie transaction 
chain 38. as shown in the Read Frame flow of FIG. 3. 
The Construct Match Frame routine utilizes a combina- 
tion of the transaction context variables of the transac- 
tion element 28 calling it. and also, ttie transaction 



10 



19 



EP0774 720A2 



20 



comext variables of the traiBng elemem 24 (TE2 in FIQ. 
11). Deftnitiofis for Fia 13 variables which are not 
definQd elsewhere In this specification, are as fbOows: 

SERVCNT: the nuni>er of characters to be repre- 
sented by the constructed match frame 

CF.F2_BEG: the beginning address of the current 
delta frame for the trailing transaction 
element 28 (TE2 in FIG. 1 1) 

Further uKlerBtanding of the method and appara- 
tus 334 for merging a sequential plurality of delta 
streanrs 30. 32, 34 to create a compiled deHa stream 
294 may be derived from the example of FIQ. 14. The 
example is based on ttie three revisions 240, 242. 244 
to the simple text fBe 238 of FIG. 9. The three delta 
streams 30. 32. 34 created by application of the method 
desabed in US. Potent appfication Serial 1^ 
08A>39.702 of Squfeb are repeated in FIG. 14. Using the 
three delta streams 30. 32. 34. and the apparatus 334 
and method described above for merging them, the 
oompied delta stream 294 of FIG. 14 may t>e con- 
structed. 

Once agaia the merge begins with tie construction 
of a transaction chain 38. and bvtializalion of all gtotxd 
arvJ contextual variables. Delta streams 0. 1 and 2. cor- 
respond wHh the transaction elements 0. 1 and2of FIG. 
11. 

The compiler consumer process 286 requests 298 
as many data bytes as the transaction chain 38 can 
supply, and initiates 296 its search 298 in traiRng trans- 
action element TE2. As in the merge of delta streams 
30. 32. 34 with an original data stream 36. the first 
request 298 of the compiler consumer process 286 
causes each transaction element 24. 26, 28 in the chain 
38 to load Its first delta frame 246. 252, 258 and write 
178 a match frame inversion 266, 270, 276 in its corre- 
sponding negative deHa stream 288, 290, 292. When 
the search request 298 (having been nmfified 128 to a 
request 298 for five data t)ytes) reaches TEO and canrxit 
be fulfilled, a match frame 318 refereridng the original 
data stream 36 is constructed 140, and the modified 
search request 298 for five data t>ytes is propagated 
t>ack ttvough the transaction chain 38 to the compiler 
consumer process 286. As tfie nunrter five is passed 
back to each transaction element 24, 26. 28, ttie prior 
stream position (PSP) of each transaction element 24. 
26, 28 is incremented 156 bf five. Upon receiving its 
unfutflled 304 search request 298. the oompaer con- 
sumer process 286 records 308 ttie constructed match 
frame 318 in the corrpifed delta stream 294. 

Another search request 298 is now initiated by the 
compiler consumer process 286. Since the current 
frame of TE2 is a current match frame 258 with five 
unused t>ytes. the search request 298 is nrxxJified 128 
and passed 146 along. TE1, having depleted the five 
tyX^ of its curent match frame, loads the second defta 
frame 254 of its detta stream 32. The second frame 254 
is a data frame, and the search request 298 is fulf aied 



134. 136 with five bytes of data 320 (COULD). As the 
number of data bytes is returned through the transac- 
tion chain 38. the PSP of TE2 rises 156 from five to ten. 
During the third request 298 of the compiler con- 

5 sumer process 286. TE2 must load a new delta frame. 
Since the new frame 260 is a data frame, its entire con- 
tents LB) are returned 134 to the conpfler consumer 
process 286, and written 306 as a data frame 322 to the 
compiled deHa stream 294. 

10 A next request 298 of the compter consumer proc- 
ess 2^ causes TE2 to load the third frame 262 of its 
delta stream 30. Since the frame 262 is a match frame, 
the SMP of the frame is compared to TE2*s PSP and it 
is determined 1 72 ttiat tfie stream 32 prior to TE2 needs 

f5 to be acMmced by five bytes (15-10=5). A second 
search request 196, for five bytes, is rutiated in tfie 
streams 32. 34 prior to TE2. Since tfie current data 
firame 254 of TE1 can supply the entre five bytes 
requested in the second search 196» and fulfil tfie nec- 

20 essary advancement of TE2'^ PSP, the second search 
request 296 is terminated. The five bytes returned by 
TE1 (NT_B) are written 204 ffito the negative delta 
stream 288 of TE2 ffi the form of a data frame 278. Hav- 
mg advanced its PSP to 15. TE2 records 178 the 

25 inverse of its newly loaded match frame 262 in its nega- 
tive delta stream 288, redjces 128 the compiler con- 
sumer process' requested nurTt>er of tiytes to fifteen 
(the length of its current match frame 262), and passes 
146 the request 298 to TE1. TE1 can supply four bytes 

30 Of data (E_A J from its current data frame 254. As the 
data 324 is returned to ttie compiler consumer process 
286, TE2's PSP is inaemented 156 to nineteen. 

The next request 298 of the compiler consinner 
process 286 is passed 1 46 ttvough TE2, being reduced 

35 128 to eleven. TE1 must now load 120 a new delta 
frame. Since its next availat^le frame is a match frame 
256, it determines 172 that the SMP of the new frame 
256 is greater than its PSP (10-5^, and the prior 
stream 34 must be advanced t>y five bytes. Since TEO 

40 fias five tiytes left in its match frame 246, and cannot 
sif}ply da^ an origi nal data sfream 36 reference token 
must be recorded 212 in the negative delta stream 290 
of TE1. Upon merging the original data stream 36 with 
the compiled delta stream 294, the reference token 

45 recorded 212 in tfie negative delta stream 290 of TE1 
may be filed 214 with the appropriate data from the 
original data stream 36. In returning the number of 
bytes In the reference token through tfie fransaction 
cfiain 38, the PSP of TEO is incremented 1 56 to ten. and 

so thePSPofTEl is inaemented 156totea An inverse of 
TEVs new match frame 256 is recorded 178 in its neg- 
ative delta stream 290. Since the newly loaded match 
frame 256 of TE1 is kniger tfian eleven bytes, the com- 
piler consumer process' 286 search request 298 is 

55 passed 146, unmocfif ied. to TEO. TEO is required to k>ad 
120 the next frame 248 of its delta sfream 34. The frame 
248 is a data frame of eight bytes. The search request 
298 is modified 128, and eight data bytes (RADICALJ 
are returned through the transaction chain 38. The PSP 
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of TE1 is incremented 156 to eighteen, and Ihe PSP of 
TE2 is incremented 156 to twenty^sa^en. Ihe eight 
returned data t)ytes are recorded 306 in the compied 
delta stream 294 as a data frame 326. 

An additional request 298 of the compiler consumer 
process 286 is reduced 128 to a rec^est 298 for three 
data t>ytes as it is passed 1 46 through TE2. The request 
298 for three data t>ytes is passed 146 through TE1 
without modification. Ifpon reaching TEO, the search, 
request 298 is held in abeyance as TEO loads 120 the 
fast frame 250 of its delta stream 34. Since the SMP of 
the loaded match frame 250 equals TE0*s PSP (10=:10), 
prior stream advancement does not occur. An inverse of 
the loaded match frame 250 is recorded 178 in TEO*s 
negative delta stream 292. Once again, TEO must con- 
struct 140 a match frame refererKing the onginal data 
stream 36. Ihe request 298 for three data bytes is 
returned to the compiler consumer process 286 unful- 
filed. As it is returned, TElls PSP rises 156 to tuveni^- 
one, and TE2*s PSP rises 156 to thirty. The constructed 
match frame 328 is recorded 308 in the compiled delta 
stream 294, 

The next request 298 of the compiler consumer 
process 286 is fulfilled with a data frame 330 represent- 
ing the two bytes (T.) of a newly loaded data frame 264 
ofTE2. 

A f inai request 298 of the compi{er consumer proc- 
ess 286 is returned unfulfilled, having been reduced to 
zero wfien TE2 ackncwledged 122 reacNng ttie end of 
Hs deHa stream 30. Ttie oompier consumer process 
286 has therefore compleled 302 its build of the oom- 
pOed delta stream 294. 

As in the merge of delta streams 30. 32, 34 with an 
original data stream 36. when TE2 acknowledges 122 
reaching the end of its delta stream 30. a ripple effect 
164 is set into action which causes the other transaction 
elements 26. 28 to wind up 224 tfie reading of their own 
delta streams 32. 34 and complete the buBdng of their 
negative delta streams 290, 292. During this process, 
another reference token 316. this time one referencing 
the last seven kiytes of the original data stream 36, is 
reooiXied 204 in tfie negative delta stream 288 of TE2. 

Although the delta streams 30. 32. 34 discussed 
herein have comprised delta frames refererx^ing match- 
ing andA)r added data t3ytes. one skilled in the art oouM 
readily use the discfc>sed method and apparatus 20. 
332, 334 n corijunction with delta streams comprising 
refererK:es to matching and/or added data Ones. The 
cfiaracters of a match frame would tfierefbre refer to line 
addresses rather than byte addresses. 

The akX3ve description has descrit)ed a mettiod and 
apparatus 20. 332. 334 for nrierging a sec^ential plural- 
ity of deHa streams 30, 32. 34. Since deitei streams 30. 
32. 34 comprise a sequence of non-overlapping match 
and/br data frames, the system yieMs exceptional 
results wtien all streams 30, 32, 34 are stored in (and 
vmritten to) sequential storage media. No other delU 
merging system is known to operate in an environment 
comprising ooly sequential storage media. 



While illustrative and presently preferred emtxxii- 
ments of the method and apparatus 20. 332. 334 have 
been described in detail herein, it is to be understood 
that the disclosed concepts may be otherwise variously 
5 emtxxiied and empfoyed and that the appended claims 
are intended to be construed to uidude such variatfons 
except insofor as limited by the prior art 

Claims 

10 

1. In a programmable computer, a method of merging 
an original data stream (36) with a sequential plu- 
rality of delta streams (30. 32. 34) to buBd an 
ifxiated data stream (52), the method comprising 

IS tfie steps of: 

a) initiating a search request (102). within the 
sequerttial plurality of delta streanns (30. 32, 
34), for anumber of data bytes to transfer to the 

20 updated data stream (52); 

b) fulfiBing (110) the search request (102) with 
data t)ytes provkled by the last delta stream oi 
tfie secpential plurality of delta streams (30. 32. 
34) wtuch is capable of supplying data t)ytes; 

2s c) if the sequential plurality of delta streams 

(30. 32. 34) is incapable of fulfiDing (110) tfie 
search request (102). fulfflling (112) the search 
request (102) witfi data bytes provided t)y tfie 
original data stream (36); and 

30 d) repeatnig steps a)-c) until a search request 

(102) cannot be fiilfOled (110. 112). and tfie 
updated data stream (52) is complete (1 06). 

2. In a programmat)le computer, a metfxxJ of merging 
3S an original data stream (36) wrtfi a sequential plu- 
rality of delta streams (30. 32, 34) to buikJ an 
tpdated data stream (52). tfie mettiod comprising 
the steps of: 

40 a) constructing a chain ^) of transactfon ele- 

ments (24. 26. 28) corresponding to tfie 
sequential plurality of deHa streams (30. 32. 
34). wfierein a highest nuni^ered transactfon 
element (24) in tfie transactfon cfiain (38) is 

45 associated witti a deHa stream (30) represent- 

ing a latest revisfon to the original data stream 
(36), a kswest numbered transactfon dement 
(28) In tfie transactfon chain (38) is associated 
wHh a deHa stream (34) representing a first 

so revision to the original data stream (36), and 

consecutively nunt)ered transactfon elements 
(28. 26. 24) in the transaction chain (38) are 
associated wHh sequential revisfons to tfie orig- 
inal data stream (36); 

ss b) initiating a search request (102). wHhin tfie 

transaction chain (38), for a number of data 
bytes to transfer to the updated data stream 
(52); 

c) fulfaitng (1 10) tfie search request (102) witti 
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data bytes provided by the highest numbered 
transaction element in the transaction chain 
(38) capable of supplying data bytes; 

d) if the transaction elements (24, 26, 28) of the 
transaction chain (38) are incapable of fuffilling s 
(110) the search request (102), futfilling (112) 
the search request (102) with data tyytes pro- 
vided by the orignal data stream (3Q; and 

e) repeating steps ayd) until a search request 
(102) cannot be fulfilled (110, 112). and the io 
updated dbata stream (52) is complete (106). 

3. A mettxxf as in daim 2. further comprising the step 
of reading the original data stream (36) directly 
from a sequential media. is 

4. In a programmable computer, a method of con- 
structing a negative delta stream (58) during a 
merge of an orignai data stream (36) with a delta 
stream (34), the mettwd comprising ttie steps of: 20 

a) initiating a search request (102). wittun a 
current match and/or data delta frame of the 
delta stream (34), for a nurvt>er of data bytes to 
transfer to an updated data stream (52) ; zs 

b) if the current delta frame of the delta stream 
(34) is empty, loading (120) a next delta frame 
from the delta stream (34) as the current delta 
frame; 

c) monitoring (172) an original data stream 30 
position associated with the delta stream (34); 

d) after loading (1 20) a next delta frame which 
is a match frame, 

i) calculating (172) a prior stream lag count 35 
as the difference between the match 
frame's source match position and tlie 
delta stream's original data stream posi- 
tion; 

ii) transferring (194. 202) a nunt>er of 40 
bytes, equaling the prior stream lag count 
from the original data stream (36) to the 
negative delta stream (58), ni the form of a 
data frame; 

iii) aeating (176) an inverse of the match 4S 
frame; 

Iv) recorcfing (178) ttie inverse match frame 
in the negative delta stream (58); and 
V) inaementing (188, 208) ttie original 
data stream position associated witii ttie so 
delta stream (34) tiy a nunt}er equeA to ttie 
prior stream lag count; 

e) if the delta stream (34) is capatDle of supply- 
ing data bytes from a curent frame which is a ss 
data frame, fulfilling (110) the search request 
(102) witti data fcrytes provided from ttie current 
frame; 

f) if ttie delta stream is incapable of fulfilling 



(110) ttie search request (102), fulfilling (112) 
the search request (102) witti data bytes pro- 
vided by ttie ori^nal data stream; 

g) incrementing ttie original data stream posi- 
tion associated witti ttie delta stream (34) by 
the number of data bytes provided to the 
updated data stream (52) by the orignial data 
stream; and 

h) repeating steps a)-g) until a search request 
(102) carmt be fulfifled (110, 112), and ttie 
updated data stream (52) is complete (106). 

& In a programmable computer, a method of merging 
a sequential plurality of delta streams (30, 32, 34) to 
aeate a compiled delta stream (294). ttie mettiod 
oortpristng the steps of: 

a) initiating a search request (298), wittiin ttie 
sequential plurality of delta streams (30. 32, 
34). for a data frame to transliBr to the oorrpiled 
delta stream (294); 

b) fijIfiOing (306) ttie search request (298) witti 
a data frame provided by ttie last delta stream 
in the sequential plurality of delta streams (30, 
32, 34) which is capable of supplying a data 
frame; 

c) if the sequential plurality of delta streams 
(30. 32, 34) is incapable of fulfBling (306) ttie 
search request (298). constructing (140) a 
match frame to transfer (308) to ttie compiled 
delta stream (294); and 

d) repeating steps a)-c) until a search request 
(29Q cannot be fulfilled (306. 308). and ttie 
compiled delta stream (394) is corrplete (302). 

6. In a programmable computer, a mettiod of merging 
a sequential plurality of delta streams (30. 32. 34) to 
aeate a compled delta stream (294), the mettiod 
comprising the steps of : 

a) constructing a chain (38) of transaction ele- 
ments (24. 26. 28) corresponding to ttie 
sequential plurality of delta streams (30. 32. 
34). wherein a highest numb^ed transaction 
element (24) in ttie transaction chain (38) is 
associated witti a delta stream (30) represent- 
ing a latest revision to an original data stream 
(36), a lowest nimbered transaction element 
(28) in the transaction chain (38) is associated 
witti a delta stream (34) representing a first 
revision to the original data stream (36), and 
consecutively nunft>ered transaction elements 
(28. 26. 24) in ttie transaction chain (38) are 
associated witti sequential revisions to ttie orig- 
inal data stream (36): 

b) initiating a search request (298). within ttie 
transaction chain (38). for a data frame to 
transfer to the compiled delta stream (294); 

c) futfilling (306) ttie search raciest (298) witti 
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a data frame provided by the Nghest nurribered 
transaction element in the transaction chain 
(38) which is capable of st^lying a data 
frame: 

d) if the transaction elements (24. 26. 28) of the s 
transaction chain (38) are incapable of fulfilling 
(306) the search request (298). constructing 
(140) a match frame to transfer (308) to tfie 
compiled delta stream (294); and 

e) repeating steps a)-d) unta a search request io 
(298) cannot be fulfilled (306, 308), and the 
compiled delta stream (294) is complete (302). 

7. A computer programmed to merge a sequential plu- 
rality of delta streams (30. 32. 34) with an original is 
data stream (36). in a single pass, to create an 
updated data stream (52). the computer oompris-. 
ing: 

a) a transaction cfiain (38) comprising a plural- 20 
ity of sequenced transaction elements (24. 26, 
28). wtierein each transaction element (24. 26. 
28) has a delta stream input (42. 44. 46) corre- 
sponding to one of the sequential plurality of 
delta streams (30, 32, 34); and 25 

b) a consun^er process (22). connected to a 
trailing element (24) of the troisaction cfiain 
(38). and comprising an input (48) for the origi- 
nal data stream (36) and an output (50) for the 
updated data stream (52), so 

8. A computer programmed to meige a sequential plu- 
rality of delta streams (30. 32. 34). in a single pass, 
to aeate a oonpiled delta stream (294). the compu- 
ter comprising: 3S 

a) a transaction cfiain (38) comprising a plural- 
ity of sequenced transaction elements (24. 26, 
28). wherein each transaction element (24. 26. 
28) has a delta stream Biput corresponding to 40 
one of the sequential pturality off delta streams 
(30, 32. 34); and 

b) a compiler consumer process (286). con- 
nected to a trailing element (24) of the transac- 
tion chain (38), comprising an output for tfie 4S 
compiled delta stream (294). 

9. A physical storage media programmed to control a 
computer in merging a sequential plurality of delta 
streams (30. 32. 34) with an original data stream so 
(36). in a single pass, to aeate an updated data 
stream (52), the media comprising: 

a) means to produce a transaction chain (38) 
comprising a plurality of sequenced transaction ss 
elements (24. 26. 28), wherein each transac- 
tion element (24, 26. 28) has a delta stream 
input (42. 44. 46) corresponding to one of the 
sequential plirality of delta streanre (30. 32, 



34); 

b) a consumer process (22) comprising an 
input (48) for the original data stream (3Q and 
an output (50) for the updated data stream 
(52); and 

c) means to coruiect the consumer process 
(22) to a trailing element (24) of the transaction 
chain (38). 

10. A physical storage media programmed to control a 
computer ffi merging a sequential plurality of delta 
streams (30. 32, 34). in a single pass, to create a 
compiled defta stream (294), the mecKa c o mpri si ng: 

a) means to produce a transaction chain (38) 
comprising a plurality of sequenced transaction 
elements (24. 26. 28). wherein each transac- 
tion element (24. 26. 28) has a defta stream 
input correspondong to one off the sequential 
pluredity of delta streams (30. 32. 34); 

b) a compiler consumer process (286) com- 
prising an output for ttte oonpOed delta stream 
(294): and 

c) means to connect the compiler consumer 
process (286) to a trailing element (24) of the 
transaction chain 
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